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Abstract 

> 

VjQ ■ We consider the problem of minimizing the number of triangles in a graph of given order 

and size and describe the asymptotic structure of extremal graphs. This is achieved by 

oo 

\ characterizing the set of flag algebra homomorphisms that minimize the triangle density. 

Os| ■ 1 Introduction 

The famous theorem of Turan |Tur41j determines ex(n, K r ), the maximum number of edges in 
^ a graph with n vertices that does not contain the r-clique K r (see also Mantel |Man07] ) . The 

unique extremal graph is the Turdn graph T r _i(n), the complete (r — l)-partite graph of order n 
whose part sizes differ at most by 1. Thus, for fixed r, we have ex(n, K r ) = (1 — + o(l)) (™) • 
Rademacher (unpublished, 1941) proved that a graph with ex(n, K%) + 1 edges has at least 
[n/2\ triangles. This prompted Erdos [Erd55j to pose the more general problem: what is 
g r (m,n), the smallest number of -KT r -subgraphs in a graph with n vertices and m edges? 
Various results have been obtained by Goodman |Goo59j . Erdos [Erd621 IErd69| . Moon and 
Moser |MM62j . Bollobas |Bol76| . Fisher |Fis89j . Lovasz and Simonovits [LSTfJl IE583] . Razborov 
|Raz07llRaz08] . Nikiforov [Nikllj . Reiher [Reil2j . and others. 
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Let us consider the asymptotic question, that is, what is the limit 

t v dcf ar{[a( n 2 )\,n) 
9r{a) = hm 

for any given a and r? While it is not difficult to show that the limit exists, determining g r (a) 
is a much harder task that was accomplished only recently (for r = 3 by Razborov [Raz08j, for 
r = 4 by Nikiforov [Nik 1 1), and for r > 5 by Reiher jRei!2j ). 

The following construction gives the value of 93(a) (as well as g r (a) for every r > 4). Given 
a < 1, we choose t and c € | such that the complete (i + l)-partite graph of order n — > 00 
with i largest parts each of size (c + o(l))n has edge density a + o(l). Formally, let integer t > 1 
satisfy 
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r t + i 



(i) 



and let real 



_ t + y/t(t - a(t + 1)) 

t(t + l) {2) 



be the (unique) root of the quadratic equation 

^c 2 + tc(l-tc)^j = a (3) 

with c > jxj. Since a > 1 — j, it follows from ([3]) that c < j. Partition the vertex set 
[n] = {l,...,n} into t + 1 parts Vx,...,Vt+i with |Vi| = ••• = \Vt\ = [cn\ for i G [t]. Let 
G be obtained from the complete i-partite graph K(V\, . . . , Vt-i, U), where U = V% U Vt+i, by 
adding an arbitrary triangle-free graph on U with \V t \ |T4+l| edges. One possible choice is to 
take G[Z7] = K(Vt,Vt+i), resulting in G = K(Vi, . . . , Vt+i)- Since each edge of G[C7] belongs 
to exactly |Vi| + • • • + \ Vt-i\ triangles, the choice of G[U] has no effect on the triangle density. 
Clearly, the edge density of G is a + o(l). Thus 93(a) < /i(a), where 

def n I I t\ 3 2/ 



^ (a) = W (1_tc) J" (4) 

If a = 1, we let G be the complete graph K n and define h(l) = 1. For a £ [0, 1], let "H a ,n be the 
set of all possible graphs G on [n] that arise this way, Tl a = U ne N^ ajn , and ?^ = f U ae [ ,i]^a- 

If a = 1 — i for integer s > 1, then we have two choices for t but c = ^ and /i(a) = 6(g) /s 3 
are still uniquely defined. In general, % a)n has many non-isomorphic graphs and this seems to 
be one of the reasons why this extremal problem is so difficult. 

Although each of the papers [Raz08, Nikflj lRei!2| implies the lower bound 93(a) > h(a), it 
is not clear how to extract the structural information about extremal graphs from these proofs. 
Here we partially fill this gap by showing that, modulo changing a negligible proportion of 
adjacencies, the set H consists of all almost extremal graphs for the 93-problem. Here is the 
formal statement. 
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Theorem 1.1 For every e > there are 5 > and no such that every graph G with n > uq 
vertices and at most (93(a) + 5) (g) triangles, where a = e(G)/(^\, can be made isomorphic to 
some graph in T-L a ,n by changing at most eQ) adjacencies. 

This theorem is obtained by building upon the flag algebra approach from [Raz08j . In order 
to prove it we have to characterize first the set of extremal flag algebra homomorphisms for 
the 93-problem. This is done in Theorem 12.11 of Section [2j where the precise statement can be 
found. This task requires some extra work in addition to the arguments in [Raz08| and is an 
example of how flag algebra calculations may lead to structural results about graphs. (For some 
other results of a similar type, see e.g. |DHM+12l iHHK+lll IPiklll IPikl2| .) 



Our initial motivation was the following conjecture of Lovasz and Simonovits [LS76[ Con- 
jecture 1] for r = 3. 

Conjecture 1.2 For every r > 3 there is uq such that for every n > uq and m with < m < (2) 
at least one of g r (m,n)- extremal graphs is obtained from a complete partite graph by adding a 
triangle-free graph inside one part. 

If this conjecture is proved, then one may consider the problem of determining g r (m,n) 
as combinatorially solved: the number of -fT r -subgraphs in a such graph G is some explicit 
polynomial in m, n, and part sizes, and the question reduces to its minimization over the 
integers. This task may be difficult but it involves no graph theory. In fact, it is not hard to 
show (see e.g. [NiklH Section 3]) that the optimal part ratios are approximately as those of the 
graphs in rl a , where a = m/Q). (However, our rounding \V\\ = [cn\, etc., was rather arbitrary: 
it was chosen just to have the family T~L a well-defined.) 

We hope that Theorem 11.11 may help in proving Conjecture 11.21 in the same way as the so- 
called stability approach is useful in obtaining exact results. One example where this approach 
succeeded is the clique minimization problem in the special case when a = 1 — \ for some integer 
t > 2. First, the results of Goodman [Coo59] (for r = 3) and Moon and Moser [MM62J (for 
r > 4) imply that for any m, n we have 

. t(t-l)...(t-r + l) (n\r „ . . 

9r(m,n)>^ > ( — ) , ift>r-l, (5) 



r! \t 

where the real t is defined by m = (1 — l/t)n 2 /2. (Note that the Turan graph Tt(n) shows 
that ([5]) is asymptotically best possible when t is an integer; thus g r (l — |) = r!(*)/i r in this 
case.) While Moon and Moser |MM62j only indicate how to prove ([5]), a complete proof can be 
found in Lovasz and Simonovits [LS83, Theorem 1] who also deduced that all almost extremal 
graphs are close to T t (n) in the edit distance: 

Theorem 1.3 For every r and e > 0, there are 5 > and no such that, for any integer t > r — 1, 
every graph G with n > uq vertices, (1 — j ± S) Q) edges, and at most (g r (1 — -|) + 5) (™) copies 
of K r can be made isomorphic to T t (n) by changing at most ef!?) edges. 
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In fact, a sharper form of this result (with an explicit 5 = 5(r,t,e,n)) wad proved by 
Lovasz and Simonovits [ES83] who used it to establish Conjecture 11.21 when ex(n, K s ) <m< 
ex(n,K s ) + en 2 for some e = s(r,s) > 0. Unfortunately, we have not succeeded in proving the 
case r = 3 of Conjecture 11.21 so far. 

This paper is organized as follows. We outline the main ideas behind flag algebras and state 
some of the key inequalities from [Raz08] in Section (2) There, we also state our result on the 
structure of ^-extremal homomorphisms (Theorem l2.ip and show how this implies Theorem I l.li 
Section [3] contains a sketch of the proof from [Raz08 that 53(a) = h(a). Theorem 12. II is proved 
in Section [H 

2 Flag Algebras 

In order to understand this paper the reader should be familiar with the concepts introduced 
in [Raz07]. We do not see any reasonable way of making this paper self-contained, without 
making it quite long and repeating large passages from [Raz07j. Therefore, we restrict ourselves 
to sketching the proofs in [Raz07 , Raz08] , during which we informally illustrate the main ideas 
by providing some analogs from the discrete world. This serves two purposes: to state the key 
inequalities from [R az071 IRaz 08] that we need here and to provide some guiding intuition for 
the reader who is about to start reading [Raz07] . We stress that some flag algebra concepts do 
not have direct combinatorial analogs or require a plethora of constants to state them in terms 
of graphs. Here we just try to distill and present some motivational ideas. 

Many proofs in extremal graph theory proceed by considering possible densities of small 
subgraphs and deriving various inequalities between them. These calculations often become 
very cumbersome and difficult to keep track of "by hand", especially that the number of non- 
isomorphic graphs increases very quickly with the number of vertices. One of the motivations 
behind introducing flag algebras was to develop a framework where the mechanical book-keeping 
part of the work is relegated to a computer. 

So suppose that we have a graph G. Let n = \ V(G)\ be its order. 

The density of a graph F in G, denoted by p(F, G), is the probability that a random |V(.F)|- 
subset of V(G) spans a subgraph isomorphic to F. The quantities that we are interested in are 
finite linear combinations Xa=i a iP(^i^G), where Fi is a graph and a.i is a real constant. One 
can view a formal finite sum Y^t=i a i^i as a function that evaluates to Ylt=i a iP{^i-> G) on input 
G. Since we would like to operate with these objects on computers, we try to keep redundancies 
to minimum. In particular, the graphs Fi are unlabeled and pairwise non-isomorphic. Let F° 
consist of all (unlabeled non-isomorphic) graphs and let KLF be the vector space that has J-" 
for a basis. (The meaning of the superscript will be explained a bit later.) 

There are some relations which are identically true when it comes to evaluations on input 
G: for example if n > £ > \V(F)\ for some graph F and we know the densities of all subgraphs 
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on I vertices, then the density of F can be easily determined: 

p(F,G)= P(F,F) P (F,G), 

where F^ C F° consists of all graphs with exactly I vertices. 

So it makes sense to factor over JC°, the subspace of RF° generated by F 
over all choices of F and I > |V(F)|. Let 

A d ^ RF°/1C . 

By ([6]), any element of A can still be identified with an evaluation on (sufficiently large) graphs. 
A theorem of Whitney [Whi32] Theorem 5a] implies that a linear combination X^S=i a iFi that 
always evaluates to is necessarily in JC° (see the discussion in [HNlll Page 551]). 

Let some Fi G for i = 1, 2 be fixed. The product p(F%, G)p(F 2 , G) is the probability that 
two random subsets Ui,U 2 C V(G) of sizes t\ and ^2, drawn independently, induce copies of 
F\ and F2 respectively. With probability 1 — 0(l/n) (recall that n = \V(G)\), the sets U\ and 
U 2 are disjoint. Let us condition on this event. The conditional distribution can be generated 
as follows: first pick a random (l\ + ^-set U and then take a random partition U = U\ U 17% 
with \Ui\ = ti. For F G J^ 1+i2 , let p(F 1 ,F 2 ;F) denote the probability that G^] ^ F f (i.e. 
the subgraph of G induced by E7j is isomorphic to i^) for both i = 1,2 when conditioned on 
G[C7] = F. This probability depends on Fi, F2, and F only. Thus 

p(F 1 ,G)p(F 2 ,G)= p(F 1 ,F 2 ;F)p(F,G)+0(l/n). (7) 

Since we are interested in the case when n — > 00, we formally define the product F\ ■ F 2 to be 
equal to J^FeT p(^i, F 2 ; F) F G MF and extend this mutiplication to ILF by linearity. 

e l+ e 2 

It is not surprising that this definition is compatible with the factorization by making A 
into a commutative associate algebra with the empty graph being the multiplicative identity, 
see Ra/07, Lemma 2.4]. 

Unfortunately, we do not have the property that graph evaluations preserve multiplication 
exactly. This can be fixed if we take as input not just a single graph G but a sequence of 
graphs {G n } which is convergent by which we mean that |V(Gi)| < |V(G2)| < • • • (we call such 
sequences increasing) and for every graph F the limit 

0(F) d ^ f lim p(F,G n ) (8) 

n— >oo 

exists. We extend <fi by linearity to BLF°. It is routine to check that <p is compatible with the 
factorization by KP and, in fact, gives an algebra homomorphism from ^4° to R (which we still 
denote by (ft), see [Raz07l Theorem 3.3]. We say that <p is the limit of {G n } and, following the 
notation in [Raz07, Section 3.1], denote this as (j) = h m n->ooP ■ 



(6) 

-Ef^op(F,F)F, 
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Clearly, (j) is non-negative, that is, (j)(F) > for every graph -F. Let Hom + (^4°, R) be the 
set of all non-negative homomorphisms. 

It turns out that every non-negative homomorphism : .4° — > R is the limit of some sequence 
of graphs. It is instructive to sketch a proof of this, see Lovasz and Szegedy [LS06J Lemma 2.4] 
(or [Raz071 Theorem 3.3]) for details. Take some integer n. Since the identity X^FeJ 70 = 1 
holds in A , we have that X^FeJ 70 ^(^ ? ) = 1> that is, <p defines some probability distribution 
on Let G n t f, G J-® be drawn according to this distribution with the choices for different 
values of n being independent. Fix some F and e > 0. Let n > \V(F)\. Easy calculations show 
that the expectation of p(F, G n ^) is exactly 4>{F). Also, the variance of p(F, G n ^), which can 
be expressed via counting pairs of F-subgraphs versus two independent copies of F, is 0(l/n). 
Chebyshev's inequality implies that the probability of the "bad" event \p(F, G n ^) — <f>{F) \ > e 
is 0(1/ n) and the Borel-Cantelli Lemma shows that with probability 1 only finitely many bad 
events occur when n runs over, for example, all squares. Since there are only countably many 
choices of F and, for example, e G {1, |, |, . . . }, we conclude that {G n 2^} converges to (f> with 
probability 1. Thus the required convergent sequence exists. 

If one wishes that the graph orders in the sequence span all natural numbers, one can 
pick some convergent sequence and fill all orders by uniformly "blowing" up its members, see 
e.g. jHHK+lll Section 2.2]. 

How can these concepts be useful for proving that 93(a) = /i(a)? Pick an increasing sequence 
of graphs {G n } of edge density a + o(l) such that the limit of p(K%, G n ) exists and is equal to 
93(a). A standard diagonalization argument shows that {G n } has a convergent subsequence; 
let (f> be its limit. Then <p(Ki) = a. Now, if we show that 

V0 G Hom+(„4°,R) ((j)(K 2 )=a =^ (f)(K 3 ) > h(a)) , (9) 

then we can conclude that indeed (73(a) = h(a), as it was done by [Raz08] . 

In this paper, we achieve more: we describe the set of all extremal homomorphisms, that is, 
those cf> G Hom + („4°,R) that achieve equality 4>{K%) = 93 ((^(i^))- 

Let C Hom + („4°,R) consist of all possible limits of convergent sequences {G n }, where 
G n G %a for some a G [0, 1] and all n. Equivalently, $ can be defined as follows. Recall that the 
join G\ V . . . V Gk of graphs G±, . . . , G^ is obtained by taking their disjoint union and adding all 
edges in between. We define a similar operation on homomorphisms (f>i, . . . , 0^ G Hom + (^4°, R). 
We need a more general construction where one specifies how much relative weight each <f>i has, 
by giving non-negative reals oti,...,otk with sum 1. Let n — > 00 and, for i G [k], let Gi >n be 
a graph with [ain\ vertices such that the sequence (Gj jn ) converges to as we have already 
remarked, it exists. Let F n = Gi, n V • • • V Gk, n - Let the join <f> = V(<^i, . . . , cftk] ai, . . . , a^) be 
the limit of {F n } (the limit clearly exists). 

Alternatively, we can define the join (ft without appealing to convergence. To this end, it is 
enough to define the density of each graph F G F°, and we do it as follows. Let aut(F) denote 
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the number of automorphisms of F. Let 

mejff E UHv 4 |)n(-i KI *ra)). 

v 7 (Vi,...,v fc ) Vl ih 1 hl/ i=i 

where the summation runs over all possible ways (up to isomorphism) to partition V(F) = 
Vi U • • • U Vk into k labeled parts (allowing empty parts) so that the induced bipartite subgraph 
F[Vi, Vj] is complete for all 1 < % < j < k. The reader is welcome to formally check that the join 
is well-defined (with respect to the factorization by JC°) and belongs to Hom + („4°, IR). (These 
facts are obvious from the first definition.) Now, <3? is exactly the set of all possible joins 

v(0 1 _^_ 1 0, V; c^^c, 1 - (t - l)c), 

t— 1 times t—1 times 

where denote the (unique) non-negative homomorphism of edge-density 0, l/(t+l) < c < 1/t, 
and ip is an arbitrary non-negative homomorphism with ^p(K^) = and ip{K2) = 2c(l — tc)/(l — 
(i-l)c) 2 . 

Our next result states that the set of ^-extremal homomorphisms is exactly $. 
Theorem 2.1 

$ = {0 € Hom + („4°,IR) : <f>(K 3 ) = 93(^2))} • 

Let us show that Theorem 12.11 implies Theorem 11.11 The shortest way is to refer to some 
known results about the so-called cut-distance 5n that goes back to Frieze and Kannan [FK99J. 
We omit the definition of 5n but refer the reader to BCL + 08l Definition 2.2]. 



Suppose for the sake of contradiction that Theorem 11.11 is false, which is witnessed by 
some e > 0. Then we can find an increasing sequence {G n } of graphs with p{K^,G n ) < 
gz(p(K2, G n )) + o(l) that violates the conclusion of Theorem ll.il By passing to a subsequence, 
we can assume that {G n } is convergent. Let </>o G Hom + (^l ,M) be its limit. Let a = 0o(A"2)- 
By ([9|), we know that ^0(^3) = h(a). By Theorem 12.11 4>o £ & and we can choose a sequence 
{H n } in % which converges to cj>Q with V(H n ) = V(G n ). 

This convergence means that G n and H n have asymptotically same statistics of fixed sub- 
graphs. This does not necessarily implies that G n and H n are close in the edit distance. (For 
example, two typical random graphs of edge density 1/2 have similar subgraph statistics but 
are far in the edit distance.) However, the presence of a spanning complete partite graph in H n 
implies a similar conclusion about G n as follows. 

Theorem 2.7 in Borgs et al jBCL+08] gives that 5n(G n , H n ) = o(l), that is, the cut-distance 
between G n and H n tends to 0. (An important property of the cut-distance is that an increasing 
sequence {G n } is convergent if and only if it is Cauchy with respect to 5n-) 

By [BCL+081 Theorem 2.3], we can relabel V(H n ) so that for every disjoint 5, T C V(G n ) 
we have 

\e(G n [S,T]) - e(H n [S,T])\ = o(v 2 ), (11) 
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where v = v(n) is the number of vertices in G n . Informally, this means that the graphs 
G n and H n have almost the same edge distribution with respect to cuts. Take the partition 
V(H n ) = V\ U ■ ■ ■ U Vt-i U U that was used to define H n . Let i £ [t — 1]. If we set S = V% and 
T = V(G n ) \ Vi in then we conclude that the number of S — T edges that are missing from 
G n is o(v 2 ). Also, the size of G[Vi] is o(v 2 ) for otherwise a random partition Vi = S U T would 
contradict (fTT|) . Thus, by changing o(v 2 ) adjacencies in G n , we can assume that the graphs G n 
and H n coincide except for the subgraph induced by U. Suppose that \U\ = fl(n) for otherwise 
we are easily done. We have 

\e(G n [U]) - e(H n [U})\ = \e(G n ) - e{H n )\ = o(v 2 ). 

Of course, when we modify o(v 2 ) adjacencies in G n , then the number of triangles changes by 
o(v 3 ). Each edge of G n [U] (and of H n [U]) is in the same number of triangles with the third 
vertex belonging to V(G n ) \ U. Since H n [U] is triangle-free and G n is asymptotically extremal, 
we conclude that G n [U] spans o(v 3 ) triangles. By the Removal Lemma [RS781 EFR86J (see 
e.g. [KS96, Theorem 2.9]), we can make G n [C7] triangle-free by deleting o(v 2 ) edges. 

If e(G n [U]) > e(H n [U]), then we just remove some edges from G n [U] until exactly e(H n [U]) 
are left, in which case the obtained graph G n belongs to T-L a ,n and Theorem 11.11 is proved. 
Otherwise we obtain the same conclusion for all large n by applying the following lemma to 
G n [U] and s = e(H n [U}). 

Lemma 2.2 For every e > there are 5 > and uq such that for every K^-free graph G on 
n > riQ vertices and every integer s with 

e(G) < s < min(e(G) + 5n 2 , [n 2 /A\) (12) 

one can change at most en 2 adjacencies in G so that the new graph is still K^-free and has 
exactly s edges. 

Proof. Clearly, it is enough to show how to ensure at least s edges in the final i^-free graph. 
Given e > 0, choose small positive c ^> 5. Let n be large and let s satisfy (fT2|) . Let m = e(G). 

We can assume that, for example, m > en 2 /3. Also, assume that m < [n 2 /A\ — cn 2 for 
otherwise we are done by by the Stability Theorem of Erdos [Erd67] and Simonovits [Sim68] 
which implies that G can be transformed into the Turan graph T2(n) by changing at most en 2 
adjacencies. 

The number p of paths of length 2 in G is Ylx^v{G) C2 ) which is at least n( 2 ™ // ") by the 
convexity of the function (2). By averaging, there is an edge xy € E(G) that belongs to at least 

2p 2n( 2m / n ) 4m r 
— > — - — - — - > 5n 

m m n 

such paths (which is just the number of edges between the set {x, y} and its complement). 
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Let G' be obtained from G by adding cn clones of x and cn clones of y. Thus G' has 
n' = (1 + 2c)n vertices and m! > m + cn(^p — 5n) + (cn) 2 edges. If we take a random n-subset 
{/ of V(G'), then each edge of G' is included with probability Q) /Q) • Thus there is a choice 
of an n-set U such that the number of edges in H = G'[U] is at least the average, which in turn 
is at least 

(m + cn(^ - 5n) + (cn) 2 ) g) c 2 (n 2 - Am) - 2c5n 2 

((l+2c)n\ " m+ (1 + 2c) 2 ' 

This is at least m + <5n 2 > s by our assumption on m. Since G and H coincide on the set 
V(G) n V(H) of least n — 2cn vertices, G can be transformed into the i^-free graph H by 
changing at most 2cn 2 < en 2 adjacencies, as required. ■ 



3 Sketch of Proof of (f)(K 3 ) > h((j>(K 2 )) 

Let us sketch the proof of Q from [Raz07, Raz08j, being consistent with the notation defined 
there. Consider the "defect" functional f(<p) = 4>(K^) — h((/>(K2)), where h is defined by 
We can identify each homomorphism <j) £ Hom(^4°,lR) with the sequence 

of its values on graphs. Let us equip all products with the pointwise convergence (or product) 
topology. The set Hom(^4°,lR) is a closed subset of M.^ as the intersection of closed subsets 
corresponding to the relations that an algebra homomorphism has to satisfy. Thus the set 

Hom + (^°,M)= p| {(f) € Hom(^°,lR) : <f){F) > 0} 
FEJ* 

is closed too. Moreover, it lies inside the compact space [0, 1]^ , so it is compact as well. Since 
h(x) is a continuous function (including the special point x = 1), our functional / is also con- 
tinuous and assumes its smallest value on Hom + (.A ,1R) at some non- negative homomorphism 
4>o. Fix one such cpo for the rest of the proof. Let a = 4>o(K2). Let t = t(a) and c = c(a) be 
defined as in the Introduction. Let b = ^(Ks). We have to show that b > h(a). 
If a = (f)(K2) < 1/2, then h(a) = and there is nothing to do. 

Let us write an explicit formula for the function h(x) defined in when 1 — j < x < 1 — jj^'- 



, , , dcf (t - m - 2yjt{t ~ X(t + l)))(t + y/t(t - X(t + l))) 2 

ht{x) = ¥{t + Tf • (13) 

If a = 1 — j or a = 1 - ^rj, then we are done by Goodman's bound (that is, the case r = 3 
of ©). So let us assume that a lies in the open interval (1 — j, 1 — jtj). Here the function ht(x) 
is differentiable and it is routine to see that h' t (a) = 3(t — l)c. A calculation- free intuition is 
that if we add one edge to H € Ha then the number of triangles increases by ((i — l)c + o(l))n 
(while the effect of the change in the part sizes is relatively negligible); so we expect that 

^(^^(t-iMS)" 1 . 
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Let us see which properties cj>o has. Let {G n } converge to </>o with |V"(G n )| = n. Let e > 
be fixed. 

It is impossible that at least en 2 edges of G n are each in more than ((i— l)c + e)n triangles: 
by removing a uniformly spread subset of these edges we get a change that is noticeable in 
the limit and strictly decreases the defect functional /. Thus, if we pick a random edge from 
E(G n ), then with probability 1 — o(l) there are at most ({t — l)c + o(l))n triangles containing 
this edge. (Note that G n has Q(n 2 ) edges by our assumption a > 1/2.) The corresponding flag 
algebra statement }Raz08l (3.3)] reads 

4> E (K E ) < -h' t (a) a.e. (=almost everywhere). (14) 

Let us informally explain (|14p . It involves counting triangles that contain a specified edge. 
Let J- E consist of E-flags, by which we mean graphs with some two adjacent vertices being 
labeled as 1 and 2. Any isomorphism has to preserve the labels. We may represent elements 
of T E as (G;xi,X2), where G G F° is a graph and xi G V(G) is the vertex that gets label 
i. Suppose that we wish to keep track of various subgraph densities and their finite linear 
combinations for E-flags. We can view (F;yi,y2) £ T E as an evaluation on T E that on input 
(G; xi,X2) returns p((F; 2/1,2/2), (G; x\, X2)), the probability that the E-subflag of G induced by 
a random |V(E)|-set X with {x\,X2} Q X C V(G) is isomorphic to (E; 3/1,2/2)- 

Again, if we know the densities of all E-flags with t > \F\ vertices, then we can determine the 
density of (F; 2/1,2/2) by the analog of ©. So we can define the corresponding linear subspace 
IC E and let A E = WF E /K E . The obvious analog of (|7|) holds, and the corresponding coefficients 
define a multiplication on MJ- E that turns A E into a commutative algebra. The multiplicative 
identity is E E T E ', the unique -E-flag on K2. As in the unlabeled case, the limits of convergent 
sequences of E-flags are precisely non-negative algebra homomorphisms from A E to the reals 
f |Raz07l Theorem 3.3]). 

Now, we can turn G n into an E-flag by taking a random edge uniformly from E(G n ) and 
randomly labeling its endpoints by 1 and 2. Thus for each n we have a probability distribution 
on E-flags which weakly converges to the (unique) distribution on Hom + (^l" E , M), see [Raz08, 
Section 3.2]. In (|14|) . cf> E denotes the extension of 0o (that is, a random homomorphism drawn 
according to this distribution) while K E is the unique E-flag with the underlying graph being 
K 3 . 

Let us consider the effect of removing a vertex x from G n . When we first remove d(x) 
edges at x, the edge density goes down by d(x)/ u) • Next, when we remove the (now isolated) 
vertex x, the edge density is multiplied by (^/(^ ) = ^ + n + 0( n_2 )- Thus the edge 
density changes by — d(x)/(^j + 2a/n + 0(re~ 2 ). Likewise, the triangle density changes by 
— K\(x)/ (g) + 3b/n + 0(n~ 2 ), where K\{x) is the number of triangles per x. Thus for all but 
at most en vertices x we have (—2d(x)/n + 2a)h' t (a) < — 3K^(x)/ (T) + 36 + e, for otherwise 
by removing such vertices (and taking the limit as n — > 00) we can strictly decrease the defect 
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functional /. In the flag algebra language this reads 

- 2ti t (a)cf ) 1 (K 1 2 ) + 2ti t (a)a < -tyl^s) + ^ a - e -> ( 15 ) 

where T 1 consists of all graphs with one vertex labeled 1, K^K^ £ J-" 1 "evaluate" the edge 
and triangle density at the labeled vertex, and 4>q £ Hom + («4 , R) is the random extension of 

4>0- 

Note that if we take the expectation of each side of (|15|) with respect to the random 0q € 
Hom + (A 1 , R) , then we get 0. (A calculation- free intuition is that the edge/triangle density of 
a graph G is equal to the average density of edges/triangles sitting on a random vertex of G.) 
Thus we conclude that (fT5|) is in fact equality a.e. ( [Raz08l (3.2)]). 

How can (fT4"j) and (fT5"j) be converted into statements about 0o? If, for example, one applies 
the averaging operator ( |Raz071 Section 2.2]) to (fT5|) . that is, taking the expected value 

of (|15p over c/>q, then one obtains the identity = 0, as we have just mentioned. However, one 
can multiply both sides of (|15p by some 1-flag F and then average. (In terms of graphs this 
corresponds to weighting vertices of G n proportionally to the density of F-subgraphs rooted at 
them.) What sufficed in [Raz07, Raz08] was to take F = K\. Denoting e = K\ for convenience 
and rearranging terms, we get ( [Raz08l (3.4)]): 

MHeKlh - 2ti t (a)le%) = a(3b - 2ati t (a)). (16) 

Applying the operator [. . .}e (averaging over 0j^) directly to (fl~4"|) is not useful. Namely, if 
we take a graph G € H a , then the graph analog of (fl~4"|) may have slack for edges that connect 
two larger parts; thus the obtained inequality will not be best possible. The trick in [Raz07 
was first to multiply (|14p by the -E-flag P% whose graph is the complement of the 3-vertex path. 
(Thus each edge of % a with slack gets weight 0.) We obtain ( [Raz08l (3.5)]): 

MIP3K3IE) < \h»U\Pi\E) = l -h' t (a)Mh). (17) 

We will also need the following identity which may be routinely checked (compare with 
[Raz08l Lemma 3.2]): 

3[eir!] ! + Z{PiKi\ E = 2K 3 + K 4 + ± K h3 , (18) 

where K s ^ is the complete bipartite graph with part sizes s and t. (Thus is a triangle plus 
an isolated vertex.) Also, we have 

±P 3 + 2le% = K 2 + K 3 . (19) 



Now, if we apply <pQ to (|T8j) and p9|) and combine with (|T6j) and (|T7|) . then we obtain the 
following inequality (cf. |Raz081 (3.6)]): 

, . a(2a-l)h>(a) + MK4) + lMKi,3) 

~ h' t (a)+3a-2 ' {M) 



11 



If 0o (-^1,3) = an d 0o {K&) is equal to the limiting ^-density in H a , then the right-hand side 
of (|20p is exactly h(a). Thus it remains to bound 0o(-^4) from below. In particular, we are 
already done if a < 2/3 since every graph in T~L a has no (or very few) copies of K4; this is what 
was done in [R az07] . Of course, the result of Nikiforov [Nikll] who determined 54(a) for all a 
would suffice here but in order to prove our new Theorem 12. II we need to analyze the argument 
of |Raz08j further. 

Following [Raz08l Page 612] define 

A = |/4(a) = 2(t-l)c, 

B = f Aa-b = -ah' t (a) - b. 

3 

Then, for example, (|15p . which is an equality a.e., can be rewritten as 

cj>l(Kl) = A<f>l(e) - B a.e. (21) 

Also, let us apply the averaging operator [. . .\e,i to (fl^]). Informally speaking, given the 
labeled vertex x\ £ V(G n ), we pick the second labeled vertex x<i uniformly at random and take 
the expectation of (|14|) multiplied by the indicator function of x\ and x<i being adjacent. Since 
{Ki\ E ^ = K\ and {Ej EA = e, we get ( [Ra^OSl (3.8)]) 

4>l{Kl) < h'(a) 0j(e) = - 05(e) a.e. (22) 
From (Ell) and (El) we obtain 



0<|<<^(e)<^ a.e. (23) 

Now let us take any individual (j) 1 £ Hom + (^4 1 ,lR) for which (|2ip - (|23p hold. Let 

$ d 4 f 0V G Hom + (^°,R), (24) 

see [Raz081 Page 612]. Informally, we take an arbitrary vertex x for G n and assume that the 
density of edges/triangles containing x satisfies (j2~Tj) - ([2"3j) . Then ip correspond to taking the 
subgraph H n of G n induced by the neighborhood of x. For example, the edge density of H n can 
be calculated by taking the triangle density at x and multiplying it by ("j^VCa ) ~ (^J}) 2- 
In the flag algebra formalism this reads ([Raz08, (3.13)]) 

where following [Raz08, Page 612] we define 

z = (l) 1 (e)/A and /j = B/A 2 . (26) 
Some calculations based on Goodman's bound © show that ( |Raz08t (3.15)]) 

if>(K 2 ) (27) 
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Summarizing (in the graph theory language): the degree of a typical x € V{G n ) determines 
the edge density of G n [N(x)}, the subgraph induced by the neighborhood N(x) of x. Moreover, 
this density is below 1 — \ + o(l). This give us a strategy for bounding the number of K^s in 
G n from below: use induction on t to bound the number of K^s in N(x) and then sum this over 
all x € V(G n ) (and divide by 4). Unfortunately, this bound on ip(K^) involves radicals and it is 
not clear how to average it. Another complication is that t(ip(K2)) may assume different values 
for different choices of (f) 1 . Both of these difficulties are overcome by proving the following lower 
bound on <j) l {K\) = V'(-f^3)(<?!> 1 (e)) 3 which is a linear function of </> 1 (e) that does not depend on 
t{i){K 2 )) f [Ra^08l (3.24)]): 

,HKl) > A> (5(1 - 2„) - + . (^) 

where, for 1 < s < t — 1, r) s is the unique root of the equation 

^ = 1-- (29) 

that lies in the interval [fi, 2/j], see |Raz081 (3.17)]. Thus the random extension 0q satisfies (|28p 
a.e. and we can average it, obtaining a lower bound on 0o(-R'4), which is Inequality (3.25) in 
[Raz08]. (Note that the expectation of cf>^(K\) is ^{K^).) It turns out that this lower bound, 
when substituted into (|20|) suffices for proving the desired conclusion b > h{a). The derivations 
(also those of (|28p ) are rather messy and do not involve any genuine flag algebras calculations. 
So we omit them and refer the reader to [Raz08] for all details. 



4 Proof of Theorem 12.11 

All notation here is compatible with that of [Raz07t lRaz08j. We exclusively work with the 
theory of graphs. As before, let 0, 1, and E denote the (unique) types with respectively 0, 1 
and 2 (adjacent) vertices. Also, p d = K 2 € J 7 ^ an d e = f K\ G T\ are the (unique) 0- and 1-flags 
having two adjacent vertices. 

We offer a Mathematica code that verifies some laborious flag algebra (in) equalities from 
this paper at this URL: http://homepages.warwick.ac.uk/~maskat/Papers/K3.nb 

Let $ C Hom + („4°,R) be the set of the conjectured extremal homomorphisms defined in 
Section[2j Let c/>o £ Hom + („4°,lR) be arbitrary such that (j)Q(Ks) = /i(0o(p))- We have to show 
that (/>o € < J?. Let a d = 4>o(p) and b = (po(K^). 

We prove Theorem 12.11 (that is, the claim that </>o € <&) by induction on t = t(a) (cf. ([T])). 
If t = 1, then a < 1/2 (e.g. by Goodman's bound ([S])), 6 = 0, and there is nothing to do: every 
non-negative homomorphism of triangle density is in $ by definition. Let t > 2 and assume 
that we have proved the theorem for all smaller t. 

Suppose first that a = 1 — ^ for some integer s. Apply Theorem 11.31 to any sequence {G n } 
convergent to </>o, say with |U(G n )| = n, to conclude that G n is o(ra 2 )-close to the Turan graph 
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T s (n) in the edit distance. Clearly, when we change o(n 2 ) edges in G n , then the density of any 
fixed graph F changes by o(l) so (fro is still the limit of {G n }. Since the limit of {T s (n)} is in 
we are done in this case. 

So let a lie in the open interval (1 — j, 1 — jfj). Let c be defined by (j3]). We assume that 
the reader is familiar with the proof in [Raz08] (that we sketched in Section [3|) and we use 
the facts established there. In particular, we know that b = h(a). This gives some noticeable 
simplifications: 

B = t(t-l)c 2 , 

B t 

» = # = W^Ty (30) 

TH-i = 1/2- 

Since any type a is simply a labeled graph |Raz071 Section 2], after unlabeling its vertices, a 
can be viewed as an element of J-" . If (j>o(cr) > 0, then the random extension cp^ G Hom + (A rr , R) 
is well-defined ( |Raz08[ Section 3.2]). Let S a ((j)o) denote its support, that is, the smallest closed 
subset of Hom + (^l ",R) of measure 1. A useful property of the support is that if some closed 
property has measure 1, then every element of S a ((fio) has this property. We fix an arbitrary 
4> l G S ,1 (0o). Inequalities (|2"T ]) - (j2"3"]) hold a.e., thus (ft 1 satisfies them. In particular, ([23]) simplifies 
to 

0< -| <4> X {e) <tc< 1. (31) 

So, we can define ip by (f24j) . 

Let us prove that ip is extremal (that is, has the smallest possible triangle density given its 
edge density). 



Claim 4.1 £ $ and ijj(p) 6 



J- I- I 

t-ii 1 t 



Proof. Let s be such that ip(p) G (1 — |, 1 — jxj]. 

We know that |Raz081 (3.25)] is equality for every 1 G 5 1 (^o)- Since it is obtained by 
averaging }Raz08l (3.24)] (our formula ([28])). the latter is also equality. The analysis of the 
calculations in |Raz08| shows that |Raz081 (3.16)] (which is equivalent to tp(K 3 ) > h s (ip(p))) is 
also equality. Thus the homomorphism ip G Hom + (^l ,R) is extremal. By (|27p we have that 
s < t — 1. The (global) induction assumption implies that ip G 

We still have to show the second part of the claim when t > 3. Recall that ifj(p) = ^r- 
by ([25]) . In view of ([30]). the quadratic equation = 1 — ^-L- has two roots: z = \ and 
z = 2(t-2) 1 (EB)' ^ ^ s impossible that z > 2 (t-2) ( w hich is equivalent to 1 (e) > ^j^-c). 
Thus, if we assume that s < t — 2, then ip(p) < 1 — r^- and z < 1/2 = ?7t_i. 

Thus, when we apply the proof of |Raz08l Claim 3.3], the case z < rjt-i takes place. This 
implies that [Raz08l (3.21)] is tight. Then |Raz08l (3.23)] is also tight. Its proof on Page 615 
of |Raz08] shows that this is possible only if p = ^jj- is the largest element of [§,^j-], the 
admissible interval for p. By ([30]) we have that s = t — 1, as required. ■ 
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The intuition behind the following claim is as follows. Identity (|2ip gives a linear relation 
between triangle and edge densities via a vertex. By Claim 14.11 we know that (|2ip also holds 
for the subgraph induced by the neighborhood of almost every vertex x G V(G). If we average 
this for all choices of x, then we get some linear relation between the densities of K4, K3, and 
K2 that has to hold for all extremal homomorphisms. Repeating we get a linear relation for 
K§, K4, and K3, and so on. 

Claim 4.2 For every r > 3, we have 

MK r ) = 2{t-r + 2)cMK r -i) - (t - r + S)(t -r + 2)c 2 fo{K T _ 2 ). (32) 

Proof. We use induction on r. If r = 3, then the identity relates b = <Pq{K^) and a = (f>o(p)- 
Both of these parameters have been explicitly expressed in terms of c and t and the desired 
identity ([32]) can be routinely checked. 

Suppose that (|3"2"j) is true (for all extremal cfto). Let us prove it for r + 1. Let (ft 1 € S 1 ^) be 
arbitrary and let ip = (f?"K e . By Claim |4.1l we know that ib(p) € [1 — — 7]. Let 7 = c(ip(p)), 
where c{x) is defined by ((3|), that is, 7 is the unique root of 

2 (C 2 + {t ~ 1)7(1 ~ {t ~ 1)7) ) = ^ (33) 

with 7 > 1/t. We have that 7 = c/4> l (e). Indeed, this value satisfies (f33|) by (p5|) and is at 
least 1/t by f)31 1) . (An informal reason is that all derived inequalities are sharp for $ and, if we 
pass to a neighborhood of a vertex in some H G "H a , then its £ — 2 largest parts have the same 
(absolute) sizes as the t — \ largest parts of H.) 

By Claim I4TTI we have that t(ip(p)) = t — 1. Thus, by the induction assumption, 

%l>(K r ) = 2{t-r + 1)7^(^-1) - (t - r + 2)(t - r + l) 7 2 ^(K r _2). 

If we now substitute 7 = c/4> l {e) and ip(K s ) = 4> l (Kg +1 ) / \4> l {e)) s , cancel all occurences of 
(0 1 (e))~ r , and average the result, we obtain exactly what we needed. ■ 

Let us define /i( r )(l) = and, for < x < 1, 

fc«(x)^rl((*)^+( p * 1 )^(l-tc)) > 

where c = c(x) is defined by ([3]). In other words, /i( r )(:c) is the limiting density of K r in the 
graphs from T-L x ,n as n — )• 00. It is an upper bound on g r (x) and, as it was recently shown by 
Reiher [Reil2j . we have in fact equality. 

Claim I4T21 has the following useful corollary. 

Claim 4.3 Let r > 3. Then <po(K r ) = h^ r \a), that is, the density of each clique is determined. 
In particular, 4>o(K s ) = for s > t + 2. 
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Proof. This is true for r = 3 as 4>q{K^) = 53(a). The general case follows from Claim B~2l by 
induction onr.i 

The following lemma verifies Theorem l2.1l if 0o is the limit of complete partite graphs. Note 
the special role of cq: it corresponds to the proportion of vertices that are in parts of relative 
size o(l). 

Lemma 4.4 Let < a < 1. Let reals c\ > ■ ■ ■ > c s > and cq > satisfy X^i=o c « = 1 an< ^ 
c o/ 2 + J2o<i<j< s c i c 3 = a / 2 - Then 

Cn Cn . . v-^ h(a) 

0<i<j<h<s 

with equality if and only if cq = 0, s = t + I, and c\ = ■ ■ ■ = ct = c, where c and t are defined 
by (EP and (0). 



Proof. The claim is routine to establish, see the calculations in [Niklll Section 3].« 

Suppose first that ^0(^3) = 0, where P3 denotes the complement of the 3- vertex path. 
Take a sequence {G n } convergent to ^0 . By the Induced Removal Lemma of Alon, Fischer, 
Krivelevich, and Szegedy Al'KSOO , we can change o(\V(G n )\ 2 ) adjacencies in G n and ensure 
that G n has no ^-subgraph. As it is well-known, a graph without an induced copy of P3 is 
complete multipartite. Here, Theorem 12.11 follows from Lemma 14.41 

So suppose that ^0(^2) > 0. We need a few auxiliary results in this case. For a graph 
F 6 J^, let F^ 1 ' G .FLj be the 1-flag obtained by adding a new vertex x that is connected to 
all vertices of F (i.e., taking the join F V K\) and labeling x as 1. 

Claim 4.5 0o([P 3 (1) Ji) > 0. 

Proof. By Claim 03] we have that (t>o(K±) = /i (4) (a). When we substitute this value into 
(|20p we obtain the tight inequality except for the extra term involving K\% (a triangle plus an 
isolated vertex). We conclude that 

^0(^1,3) = 0. (34) 

Inequality (fT7|) is also used in the proof, so it has to be tight. Since we assumed that 4>q{Pz) > 0, 
we have that (t'oilP^ K^]e) > 0, where is the unique E-Hag on P3. But 

iP,Ki} E = \K^+ l -ip£X, 

and the claim follows. ■ 

The two graphs in Figure [TJ called G\ and G2, will play a special role. 

Claim 4.6 <j> ( G l) = MG2) = 0. 
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Figure 1: Exceptional graphs 




P 



E.c 



jE,b 



F 



E 



Figure 2: Some -E-flags 



Proof. We apply the same strategy (although with much more involved calculations) as 
the one used to prove (|34p . Namely, we make up an analog of (|20p that is tight on extremal 
homomorphisms and such that the "overall slackness" involved will cover G\ and Gi- 
Form the element f E £ T E as follows: 

-E def lpB,c _ }_pE,b _ pE 

J 2 4 2 4 ' 
where P?' c ,P?' b ,F E € are shown on Figure [2J Since (fT7|) is tight, 

0to ) < =► <t>%(P 3 E ) = O a .e. 

and, since both P^' 6 and F E contain P E , this implies that 

^{Ki)< l -h' t {a) =► ^(/ B )>0,e. (35) 

(Recall that /i£ is just the restriction of /i to the interval [1 — j, 1 — |xj] as defined by (|13p .) 
Thus, by (O, we can multiply the left-hand side of ([35]) by f E , obtaining a true inequality. If 
we apply the averaging operator [. . to this new inequality, we get that 

Ulf E Ki\ E ) < \ti t (a)M\f E }E). (36) 

Next, similarly to [B.az081 (3.4)] but multiplying [Raz08, (3.2)] ( i.e. our formula (I15p which is 
equality) by K\ rather than by e, we obtain 

MH(Kl)% - 2h' t {a)leKl\ x ) = b(3b - 2ah' t (a)). (37) 

Subtracting (|37l) from (j36j) multiplied by 3, and re-grouping terms, 

3MU E K E h - KK^fh) + h'^UneKMi ~ U E }e) < b(2ati t (a) - 36). (38) 
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But we have 

2[eK 3 1 l 1 - \f E \ E = ±K 3 + \k a - ±K l>3 (39) 

and 

\f E Ki\ E - \{Kl)% >^{Gi + G 2 ) - Qtf 4 + \ P K, + ±tf 6 ) • (40) 
Substituting these relations into ([38j) . and using Claim H~3l we conclude by (f34"|) that 

^MGi + G 2 ) < b(2ah' t (a)-3b)-h , t (a)(^b + ^h^(a)\ 

+ (lh^(a) + ab+^h^(a)\ = 0. 

Claim H~6l is proved. ■ 

Lemma 4.7 Lei G be a graph on V = {x\, x 2 , x 3 , y, z] as follows. Let the vertices xi,x 2 ,X3 
induce P3 with x\x 2 G E(G). Let y be adjacent to each Xi and let z be non-adjacent to at least 
one Xi. If yz G" E{G), then G contains as an induced subgraph or G is isomorphic to G\ 
or G 2 . 

Proof. If zxi,zx 2 G E(G), then zx% G" E(G) and G — y = K\^. If zxi,zx 2 G" E(G), then 
G — X3 = K\^. So we can assume without loss of generality that zx\ G E{G) and zx 2 G" E(G). 
Now, if ZX3 G" E(G), then G is isomorphic to G\\ otherwise G = G 2 .m 

Now we are ready to put everything together. 

Let a be the 3-vertex type whose graph is P3 with labels 1 and 2 being adjacent. Let {G n } 
converge to 4>q with |l/(G n )| = n. By Claim EDS] G n has f2(n 4 ) copies of Fq 6 J^, which denotes 
a triangle with a pendant edge. Let Fi G T\ be obtained from Fq by putting label 1 on a vertex 
of degree 2. Let F3 £ T1 be the (unique) cr-flag that can be obtained from F\ by adding labels 
2 and 3. 

Fix small positive e ^> 5. Let X = {x G F(G n ) : p(Fi, (G n ;xi)) > e}. By counting copies 
of Fq in G n , we conclude that 

2(^(fb) + o(l)) Q < |Jf| ( n 3 *) + (n - |X|) £ ( n " X ) , 

implying that |X| > 2en. Likewise, by (|3"T|) . the set Y = {y G V(G n ) : p(e, (G n ;xi)) < 1 — e} 
has size at least (1 — e)n. Thus |XnY"| > en. Similar counting shows that for every x\ 6Xn7 
there are at least 5n 2 pairs (x 2 ,x 3 ) of vertices with p(F 3 , {G n ;x\,x 2 ,xz)) > 6. 

For x\,x 2 ,xs as above, let V\ consist of all vertices of G n adjacent to all of xi,x 2 ,x 3 and 
let V 2 = V(G n ) \ Vi. We have \V X \ > 5(n - 3). Since x\ G Y, we have \V 2 \ > e(n - 1) (note 
that all non-neighbors of x\ are in V 2 ). For each non-adjacent y G Pi and z G V 2 , the 5-set 
{xi, X2, X3, y, z} contains G\, G 2 or by Lemma I4T71 By (fM)) and Claim |4"U| each of these 
graphs has density o(l) in G n . Thus there is a triple (xi, x 2 , X3) with e((5[Vi, V 2 \) = o(n 2 ). 
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Fix one such choice. By taking a subsequence, we can assume that | | / n tends to a limit a.i 
and that G n [Vj] converges to some homomorphism (pi, for i = 1, 2. Now, (po = V(0i, (p2, Qti, 02), 
where «i > 5 and «2 > £ are positive. 

Let t = 1 or 2. Each ^ is an extremal homomorphism: for example, if there is (pi with 
(p'i(p) = 4>i(p) and (fi'^Ks) < (p\{K^), then V(0' 1 , 02> «i> 02) contradicts the extremality of 0o. 
Since (po(K t +2) = and a^-i > 0, we have (pi(Kt+\) = for i = 1,2. Turan's theorem and 
the Removal Lemma (or just ©) imply that <pi(p) < 1 — j. Thus we can apply the (global) 
induction and conclude that (pi £ <E>. 

Let be the limit of complete partite graphs that have the same edge and triangle density 
as (pi and let (p' = V(0' 1; (p' 2 , a\, 02)- Then (p' is also extremal and all the claims that we proved 
for (pQ apply to (p' . In particular, (p' (Kt+2) = 0. Thus (p' Q is the limit of complete partite graphs 
with s < t + 1 parts. Assume that s > 3 for otherwise we are done. By Lemma 14.41 the parts 
have the correct ratios. We conclude that one of <p\, say (p[, has equal part sizes and thus 
(p\ = (pi. By induction, we have (p2 G < I ) . The required structure of the join <pQ routinely follows 
from Lemma 14.41 

This finishes the proof of Theorem 12.11 
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